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Sir: 

We, Hongjun Ji and Craig A. Rosen, hereby declare and state as follows: 

1. We are co-inventors of the subject matter described and claimed in the above- 
captioned U.S. Patent application (Exhibit A). 

2. We are informed that the Examiner of the above-captioned application has 
rejected the claims based on the subject matter disclosed in U.S. Patent No. 6,054,289 
(hereinafter the m 289 patent"; Exhibit B). We also understand that the rejected claims are 
those provided herewith as Exhibit C and are directed to Breast Cancer Specific Gene 1 
(hereinafter "BCSG1"). We hereby state that we are the sole inventors of the subject matter 
relating to BCSG1 and the claims provided in Exhibit C. 

3. We are co-inventors, along with Paul A. Moore, Reiner L. Gentz, Jian Ni, and 
Jing-Shan Hu, of the subject matter disclosed in the '289 patent. Although Drs. Moore, 
Gentz, Ni, and Hu, contributed to the disclosure relating to other genes disclosed in the '289 
patent, Drs. Moore, Gentz, Ni, and Hu did not contribute to the disclosure of the ? 289 patent 
relating to HBGBA67X. We hereby state that we are the sole inventors of the '289 patent 
disclosure relating to HBGBA67X. 

4. Additionally, we hereby state that BCSG1 sequences disclosed in the above- 
captioned application and those disclosed as HBGBA67X in the '289 patent correspond to the 



same gene and that we are the sole inventors of these disclosures in both the above-captioned 
application and the '289 patent. 

5. We declare further that all statements made in this Declaration of our own 
knowledge are true and that all statements made on information and belief are believed to be 
true; and further that these statements are made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, under 
Section 1001 of Title 18 of the United States Code, and that such willful false statements may 
jeopardize the validity of the application or any patent issuing thereon. 
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Breast Cancer Specific Gene 1 

The present application to a Continuation-m-Part of AppUca1to»mber 
08/673,284, filed on June 28, 1996, which is herein incorporated by reference; 
said 08/673,284 claims the benefit of the filing date of provisional appl.cat.on 

60/037,080, filed February 3, 1997, which is also herein incorporated by 
reference. 



Field of the Invention 



The present invention relates to a novel breast cancer specific marker. 

breast cancer specific gene 1 (BCSG1). BCSG1 polypeptides are also provded, 
as are vectors, host cells and recombinant methods for producing the same. Also 
provided are diagnostic methods for detecting breast cancer. The invent™ 
tafar provides an isolated BCSQ1 polypeptide having an amino ac.d sequence 
encoded by a polynucleotide described herein. 

Related Art 

States every year, with incidence increasing by approximately 1% — 
( Goldhn^h,A.,^C/97 : 1141.1145(1995) ; Emster,V.L., e rat,^275:913- 

918 (1996)). Studies linked to the discovery of new genetic markers and 
additional risk factors could provide new information ma. fits into the complex 
patieutmanagementissuessu^undingbreastcancer. Many new prognostic and 
predictive factors have been proposed and studied for breast cancer. HER 2/neu 

Oncol iO:599-605 (1992); Gusterson BA, e, ai, J. Clin Oncol. 70:1049-56 
(1992)) P 53 alteration has an indication of poorer prognosis and poor response 



,. . in.imo-34a995y,ElledgeR.M., 
totamotfen^erghJ.,^." ^kon^expressron 
- «- ^ poor prognosis in invasive 

ta. an indicative vaiue ^ Ses ^ 25 :l75- 8 7 0993)). 

durtal carcinoma (SteegP.b., aJ>peaIS to 

Cathepsin D, a protease -*-*^ vil(i 270.4S4- 

7 (1W5V, Schena, M, * al, ^ B.D. Hames (ed.). I-ndon: 

LRL Press., 0992), PP- F tionswith factor Vffi antibodies 

(199 4)). Positive—s^goftomor^ ^ ^ 

— — ^rrr 

i8 :l65-98 (1993); Hams A.L., ef a/., * ft ^ been 

*- r 771* <o be an indicator of muiudrug 
Expression of the m*-i •« » (1995); Gasparini O, - 

resistance (Harris A.L., et a., c ■ n 993)». P«> r response to 

-^-^r^rrrrnpU, . 

endocrine therapy has been . 75 ^ ( , 9 95». Alsorecerwng 

^vator/u^bitorCF^A^^ ^ CAlandBR CA2 

^, Y , £ Ml, Sc,-^«^ U 9 J o _ 1M (1904)) . 

2090 0994); Futreal, P.A., e, a/., ft— ^panied by 

Tta. me onset and P»^»^ ve ^ons in 

vegeneticchangesr^ 

Cfc W. ^ «-«- 100 ^ " J^^complementofnovdtranscnbed 

^^^.^i: ^As co-nd provide ciinicaUy usefu, 
^as. .dentification of the* ^ our 

information for patient management and p gn 
, unders^ndingofbreastcancerpamogenesrs. 
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clo ning methods, such - differential ^£?Z genes ^tiallv 

a/., FASEB J. 7.vc» * <<.o 53 7-2541 (1995); Zou, Z., ef a/-. 

0** Zhang, M.- ^Xt ^L have .vowed the 
SciOT ce .03:526-529)). However, these m ^ 

cDNA sequencing of individual genes (Sager, R., * ^ 

Alfcough pathological endpoints such as tumor s,«, *mP 
•c electing treatment stratepes for breasi can 

individual tumor e.g. probability ^ 

• j *m^A few have contributed to denning 
factors have been identified, few nave 

to therapy. 

Summary of the Invention 

-t— rt: 

. — - ri^rrrrdU^^ed 

sequence shown in Figure 1(SEQ ^ ATCC Deposit Number 

by fc cDNA --^^^56 on^23, 

Tne present — ' ^ ^ and to host ceUs 

.hc isolated nucleic acid molecules of the presen 



eon^^reco—vec^rs, as wellastomemcKisofn^gsuch vectors 
m d host ceUs and for using ten for production of BCSOl po.ypep.des or 
peptides by recombinant techniques. 

In accordance with another aspect of the present invention, there is 
provided a method of and products for diagnosing breast cancer me*s«ases by 
detecting an aUW level of a polypeptide corresponding to the breast specie 
genes of the present invention in a sample derived from a hos, whereby an 
elevated level of the polypeptide indicates a breast cancer diagnosts. 

The present invention further relates to antibodies spectfic to the 
polypeptides of the present invention, which may be employed to detect breast 
cancer cells or breast cancer metastasis. 

The polynucleotides and polypeptides described herein are usefin as 

markers for breast cancer. 

Brief Description of the Figures 

y~ n +iA* r«5FO ID NO- i) and deduced amino acid 
Figure 1 shows the nucleotide (bUl^ iu wv.ij <^ 

r nr^GI The protein has a deduced molecular 
(SEQ ID NO:2) sequences of BCSG1. ine proie 

weight of about 14.2 kDa. The predicted amino acid sequence of the BCSG1 

protein is also shown. 

Figure 2 shows the differential cDNA sequencing approach. Messenger 
^ftomnormalanddiseasedtissues were extracted andused for marine 
cDNA libraries. These libraries are searched by EST method mvolvmg 
automated DMA sequence analysis of randomly selected cDNA clones. The 
ESTs with overlapping sequences were grouped into unique EST group, Each 
unique EST group, which does not overlap to each other in sequence, was 
a^lyzed for its re,ative exp^sion by examining the number of expressed 
individual EST in the libraries of normal vs diseased tissues. Three EST groups 
axe listed. Blue EST group represents gene mat is equally expressed m bom 



u i „«i RSCG-1 cDN A coding sequence. The 
vector (SEQIDNO:10)andthesubclonedBSCG 1 

Fig ure4showsthenncleotidese q uenceoftheregulat ry 

t r^EOlDNO-H). The two /ac operator sequences, the Shane 
pHE promoter (SEQ ID HO. J restriction sites 

„„ r a/ryi and the terminal ffimflli an Q iVM 
Delgamo sequence (S/L», ana ll 

(italicized) are indicated. 

Detailed Description 

cDNA. The BCSGlprotem of the present m ^ 1(SEQ ID 

i 'A The nucleotide sequence shown mngux 
^ftnuman AD amylcd. The »* ^ on 

N0 ,) was obtained* ^ 1M01 Parklawn 

nl iQQ7 a t the American lype ^ lulc 
January 23, 1997 ft ^ n)mber 97g56 . The 

Drive, RockvUle, Maryland 20852, and * (stratage ne, La 

20852, and given accession number 97175. 
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a 11 nucleotide sequences determined by 

sequen^(suchas«heModel373f*m^ ^ „ detemilB d herein 

^^ueneesofpoivpep^en^ed^K ^ ^ 

W ere predict by Ration ^ D 0MA seance determined * «• 
Therefore, as is known m the art dstenn i ned hereto may contarn 

^omated approach, any nncieo.de sequen^ _ ^ . 
« errors. Nucleotide sequences , ^ ^ ^ ^ 

least ^n. 90* identical, more typrcally -J- ^ ^ 

99 . 9% identical to the actual nuc^tide «, ^ ^ by ^ 
m „lecule. The acmal sequence can be " ? weU taown in the art 

— T ST or deletion to a determined 
As ^ - — » *• ttiracma, ^nce v.llcansearramestoftm 
nucleotide sequence compared to the ac ^ ^ ^ 

nation - the nucleotide ^L,uen« will - completely 

sequence encoded by a detenmned encoded by the sequenced DNA 

mo ,ecule, beginning at me point of such 
ta Figure 1, a nucleic acid molecule of th 

^tidemaybeob^edustogsun^om^^ 

of me invention, the nuclerc actd molec gene was also 

^d^ta^ B « 4 7 4 *2ri h . deUrmined nucleotide 
identified in cDNA libraries from bram ^ ^ ^ 

sequent theBCSC, ^^^^Ldues, ^ an initiation 
^^r:^— tidesequencein.igureHSBQ^.O,), 
codon at positions 12-l« 



♦ 1 a o VDa. The BCSG1 protein shown 

timer's dise.se (AD) amyloid protein, possibilities of 

* ^r^rv skill would appreciate, due to me 
As one lypeptideencodedbytodeposited 

fencing errors, the BCS ™> be anywhere in the range of 

cDNA comprises about 127 amino acids, but may be jrw 



110-140 amino acids. fA „„« sm t invention may be in 

me fonnofRNA,suchasmBNA,orin ftr nroduced synthetically. The 

a «mi c DNA obtained by cloning or produced syu 
cDN A and genomic DMA ootam c; ne i e -stranded DMA or RNA 

By isoiawu environment. For 

DNA or KNA, wbich has been removed *» *~ ^ 

, . ant nNA molecules contained in a vector ar 
example, recombinant Furth er examples of isolated 

isola ^ for me purposes of mepres^m ^^ 

host cells or purified (partially ™ A transcripts of the DNA 

"^l^Idmolecuies of the present invent* -J-J™ 

N ° :i ^rrr:b:i:--^^^^ 

from those described above but wm n, fa ^ 

•„ ^ BCSG1 protein. Of course, the genetic code is weu 
sullencodetheBCSGl p _ one skiUed m the art to generate such 

art Thus, it would be routine for one skiiiea 



degenerate variants. 
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encoding the BCSG1 -^^-"T^J^U* 97856 on 

lanuary 23, 1997 or conned » the ph» P ^ ^ 

97175 on ,une2, 1995. The — n > (SEQ roNO :D ox 

^nucleotide sequence ^ lacktog the N-tenninal 

aeposited Cone, «he full-length BCSGU» ^ „ one 

10 of the above sequences. Suchisoia hybridization wvm 

are useful as probes for 6- > * human tissue, 

^oson.es.andforo^cunge^ress.onof.ieBCSGl 

for „bvN 0 ^b,«an^ tQ ^ rf ^ isolattd 

The present invention is fur* ^ 
15 nndeic acid moleou.es describe, ^* ^ deposited cDNA or Ore 

^d molecule having the Q:1) is tot end=d fragments at 

nucleotide sequence shown in FW» ( ^ ^ prefcrably 

mahout 15 "^^^^atleastabout^ntinlengm^h 
aUeast^On.andevenmorepreferabl^ 

20 — — — lB, S-« ^ 275, 300, 325, 350, 375, 

fragments 50, 75, 100, 125, 150, 175, 20 ^ ^ 

400 , 425 , 4 50,475,500,5^ntm^ to ^ ^ ^ 

present invention as are fragmen^ J ^ figure 1 (SEQ ID 

nucleotide sequence of the deposited DNA 

25 ~* 

^sequence as shown in Figure 1 C*EQ ID 
deposited cDNA or the nucleotide sequence 

NO:l). , „f,h.nresent invention include nucleic 

Preferrednucleicacidfragmentsofthepre-n fc 

acid molecules encoding epitope-bearing portions of the B 



„ eleic acid fragments of the present invention include nucle,c 
p^sucnnucie^d^ g ^ acid residues from 

acid molecules encodmg. polyp P e ^ 

ab „ut 94 to about 107 in F.gure 1 (SEQ ID ^ ^ ^ 

^ ^ residues from about 120 to about 127 

, a *«™ined that the above polypeptide iragmenu, 
^emventorshavedeterrrnnedtha ^ ^ ^ 

regions of the BCSG1 pro > . desotibed ^ detail below. 

^^^Itv^onpldesauisolat.dnucleicacidrnolecule 
toanoteaspect.u.e.nv^o P hvbridiz ation 

^ ^^l^Lanncleicacid moleculeof the 
eonditionstoapomonofthepoly to ATCC 

— ircr £ sits- — * — 

r^its 97S56 or 97,75 By g^ ^ ^ ^ 5x ssc 

0 vennghtincubationa,42 <>»■«» phosphate (pH 7.6), 

050 mMKaCl. 15 rnM»v3od.um mM ^ ^ 

5X rearers solution, 10% ""T^OU SSC at about 65'C. 
_r.UA followed by washing the filters inu. ix =■=> 

^ f "L Jl n bvbridizes to a "portion" of a polynucleotide 
By a polynucleotide winch hybn to * lea st about 

1 5 ^ rlTentr^l, about 30-70 nt of the reference 
at least about 30 nt, ana even y ^ &seasxi 

po.ynudeotide. Tnese are useful as dtagnostic probes and p 

above and in more detail below . of ^,^20 n. in length," for example, 

By aportionof ***** 
iS ^ 20 T mO ^r-C2^or.enucleotide^nce 
me reference r^lynucleoti&^ g^ P ^ ^ ^ 

as shown in Ftgure 1 (SEQ ID W > fa .^^^tractofme 
hybridizesortytoapoly A^^as ^ , 

BCSG1 cDNA shown m Figure 1 (SEQ ID ' leotide „ fffi e 
^h of T (or U) resides, wouid no, be - £ ^ 

mention used to hybridize to a portion of a nucletc ac 
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v nucleic acid molecule centring 

apolyCWswtcfortocomplem^ 

cDKA clone). of ^ present invention wtoch 

As m dica«d, nuc.ee acrd mofcc* ^ ^ Mld 

encode a BCSG! pottpeptide ma, - fa ^ ^^eptide 

^enceof u,e^ptide>y-^ ^ ^ Mid . 

md additional seo.uences, such " * ^w-**-"** 

J„* the aforementioned additional 

SM1 „ence ^^X^S™^^'™^ 
^ seq uence S ,«o i e*e I ™*add I ti ^ , ^^as 

*~M-^~*r££^ signals, for e^e - 
processing, including « f ^ m j^onai coding se^ence v^cn 
Hboso.nebindingand^tvofn^A ^ ^ 

eo.es for additional amino ^.^^b.*-" 
^onahties. » *-*~» . peptide vMch faciUta.es 

^er sequence, such as a ^ ~ ^«*»«""» 
paction of ti^efosed Pc^e , ln~ 

, „f the invention, the marker ammo ac oterSj 
aspectofthemv inao QE vector (Qiagen, Inc.), am » 

peptide.suchasthetagprovtdcdmapQ ^ ^ „ „, w 

9 ^anotherpepude 
anient purification of the «— p~*- ^ ^ 

5 —^^^^^^--0, fused 

(198 4). As discussed below, o 
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variant. By an "allelic variant" is intended one of several alternate forms of a 
gene occupying a given locus on a chromosome of an organism. Genes II, Lewin, 
B., ed., John Wiley & Sons, New York (1985). Non-naturally occurring variants 
may be produced using art-known mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, 
deletions or additions, which may involve one or more nucleotides. The variants 
may be altered in coding regions, non-coding regions, or both. Alterations in the 
coding regions may produce conservative or non-conservative amino acid 
substitutions, deletions or additions. Especially preferred among these are silent 
substitutions, additions and deletions, which do not alter the properties and 
activities of the BCSG1 protein or portions thereof. Also especially preferred in 
this regard are conservative substitutions. 

Further embodiments of the invention include isolated nucleic acid 
molecules comprising a polynucleotide having a nucleotide sequence at least 90% 
identical, and more preferably at least 95%, 96%, 97%, 98% or 99% identical to 
(a) a nucleotide sequence encoding the BCSG1 polypeptide having the ammo 
acid sequence in Figure 1 (SEQ ID NO:2); (b) a nucleotide sequence encoding the 
polypeptide having the amino acid sequence in SEQ ID NO:2, but lacking the N- 
terminal methionine; (c) a nucleotide sequence encoding the BCSG1 polypeptide 
having the amino acid sequence encoded by the cDNA clones contained in ATCC 
DepositNos. 97856 or 97175; or (d) a nucleotide sequence complementary to any 
of the nucleotide sequences in (a), (b) or (c). 

By apolynucleotide having a nucleotide sequence at least, for example, 
95% "identical" to a reference nucleotide sequence encoding a BCSG1 
polypeptide is intended that the nucleotide sequence of the polynucleotide is 
identical to the reference sequence except that the polynucleotide sequence may 
include up to five point mutations per each 100 nucleotides of the reference 
nucleotide sequence encoding the BCSG1 polypeptide. In other words, to obtam 
a polynucleotide having a nucleotide sequence at least 95% identical to a 
reference nucleotide sequence, up to 5% of the nucleotides in the reference 
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sequence may be deleted or substituted with another nucleotide, or a number of 
nucleotides up to 5% of the total nucleotides in the reference sequence may be 
inserted into the reference sequence. These mutations of the reference sequence 
may occur at the 5' or 3' terminal positions of the reference nucleotide sequence 
or anywhere between those terminal positions, interspersed either individually 
among nucleotides in the reference sequence or in one or more contiguous groups 

within the reference sequence. 

As a practical matter, whether any particular nucleic acid molecule is at 
least 90%, 95%, 96%, 97%, 98% or 99% identical to, for instance, the nucleotide 
sequence shown in Figure 1 (SEQ ID NO:l) or to the nucleotides sequence of the 
deposited cDNA clone can be determined conventionally using known computer 
programs such as the Bestfit program (Wisconsin Sequence Analysis Package, 
Version 8 for Unix, Genetics Computer Group, University Research Park, 575 
Science Drive, Madison, WI 537 1 1 . Bestfit uses the local homology algorithm 
of Smith and Waterman, Advances in Applied Mathematics 2: 482-489 (1981), 
to find the best segment of homology between two sequences. When usmg 
Bestfit or any other sequence alignment program to determine whether a 
particular sequence is, for instance, 95% identical to a reference sequence 
according to the present invention, the parameters are set, of course, such that the 
percentageofidentity is calculated over the full length of the reference nucleoUde 
sequence and that gaps in homology of up to 5% of the total number of 
nucleotides in the reference sequence are allowed. 

The present application is directed to nucleic acid molecules at least 90%, 
95% 96%, 97%, 98% or 99% identical to the nucleic acid sequence shown in 
Figure 1 (SEQ ID NO:l) or to the nucleic acid sequence of the deposited cDNA, 
irresp^tiveofwhemermeyencodeapolypeptidehavingBCSGl activity. Tms 
is because even where a particular nucleic acid molecule does not encode a 
polypeptidehavingBCSGl activity, one of skill in the art would still know how 
to use the nucleic acid molecule, for instance, as a hybridization probe or a 
polymerase chain reaction (PGR) primer. Uses of the nucleic acid molecules of 
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include, inler alia, ^^ on ^ e g _ "FISH") to metapbase chromosomal 

cD K A UW,( 2 ).^ ^ rf ^ BCSO1 gene , a, 

spreads to pro™ie ^ onm<mes:AMam a, of Basic Tec^. 

describedtaVermaeto!., Human o for d ^ Bg 

Pergamon Press, Ke W York (1988V, and Northern 

By'apolypeptidehavmgBCSGlactmty cgG1 p^in 

^^SIS^- ******* 

of *emventio,,asmeasured m aparUcul ^ ^ „ measured 

beUevedtobemvoWed^thapoptosis. BCSG1 pro ^ 

measure apoptosis. For example, human ore 
using assays that measure ap ^ m OTm0Ut 

cultuI ed on Lab-Tek chamber ^ M Us are then 

^rnoinantBCSGl protein or a candidate ^ h as adriamycin. 

^^^Tb^-irr^l celts .hereon 
Apoptosis is compared between ft ^ foUowmg assay. At 

variousmnepouv.a^theM hmyMole (Boehringer Mannheim) m 

~ ffi "I^T- — — » — ofstainingon 
a l u g/mlmethanol solution. Ceiisare FOTeriments are performed in 

a Zeiss Axiopho, epiflouresence —JJ^Jor condensed 

. ^ * i oct i SO cells scored at each point, tragus 
ripnca* wi* at least. 50 ceUs ^ ^ ^ ^ 

nnclei are scored as apoptotic. Intactor neotordma ryskUl 

"—sizzz numb. — . ~ 

" Mn " r^eas, 90-,, 95%, 96%, 97%, 98%, or 99% 
molecules having a sequence at 



-14- 



a „r*w A or the nucleic acid 

^ *T ! ^ since degenerate variants of these nucleotide 
BCSG1 protein actrvrty. « be clear to me sldlled artisan 

even without perfoimmg the above ^ are not degenerate 

r ^ tooKart ^fo IS uchnuce K ae«^ ^ 

^inactivity. ^-^^l^e.vtos^ve^p.oteU, 
^bsdtntionsiatatee.therlesshte^ ^ a ^ nd auplrfc amino 

^„n(e.g.,replacingoneal 1 phat 1 cammoao 1 dw 1 

acid)- . „ h „ w „, m ake phenotypically silent 

For example, guidance concemmg how to m ^ P ^ 

247.1306-1310 (1990), wherem the authors ui 
tolerant of amino acid substitutions. 



Vectors and Host Cells 



• w. relates to vectors which include the isolated 
representation also relates to ^ 

DNA molecules of the present mventron, ho, ^ rf BCSQ1 

- ... the recombinant vectors, and the pro 
engines wrththe recom 

polypeptides or fiagments -hereof by _ 

Tne polynucleotides n»y - ^ ^ m . 

precipi r v t if » " * 11 ^ 1)6 ^ in wmi " 

charg ed Uprd. ^^^^^^nsducedlntohostcell, 
appropnate packagrng ceU to ^ promoter> 

The DN A insert should be operati y 
^chasthephagelambdaPLpromoter.theE co„ to. »P 
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of retroviral LTRs, to name a few. 

Otar stable promos wul be too tenniEatio)1 md; 

will «-* — *<«^ P xue coding 

m transcribed region, a ribosome b^hng ^ ^ 

UOA 01 UAO) appropriately po 

translated. . ^ refer ably include at least one 

M seated, *e ^°^ I (iibydrofclat e reductase or neomycin 
stable marker. Such — - ^ ^ ycline or ^plciUin resistance 
resistan ce & r eukaryotic — Representative espies of 
gen es for culturing in E. col. and oft-V^ ^ ^ „ £ . coli , 

appropriate hosts include, but are not luWto. yeast ^ 

^ eeUs such as S2 and ^ J ^ ^ ^ ^ 

CHO, COS and Bowes "^^^^^ are known in the art. 

toaddiuontomeuseofe^ ^ 
mention, the present invennon turner - ^ 

^prising operator and —^.T— - "* ' ^ * 

^ueuces encoding a protem of merest. 

^Swluchisdescribedinde^bao^ tthepH E4-5 vector 

As summarized in Frgures 3 and 4 co J*> ^ a 

4) wo fcc operator ^ ^gin of repUeation (oriC) is delved from 
pU C19 (LTI, Gaithersburg, MD). Tne P ^ „ weU 
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, = , ilto CA 94303. A nucleotide sequence encoding 
East Meadow Circle, Palo Al«o, CA 94 0^ ^ ^ fey 

BSCG-1 (SEQ IDNO-.l), is operatrvelv ste of ^ 

^rung the nudeotide sequence between the Kdel and Asp 

pH E4-5 vector a ^ geM . ^ is an 

As no«ed above, *ep^ ^^r. Amann, 
allele of the telgenewtach confers tightregm , 81) . m;acI<1 

, ^ *0-™i-315 0988);Stark,M.,Ge>>e5J.255-267U*> J 

E ' " 7 '^r proi which binds to ^ operator sequences and 
gene encodes a repressor pro ^ ^ fadq geM 

H ock S «ran S c ri ptionofd„™-s«rea I n(.e.,3)se4ue ^ „ 

ee^ lactose ■^^'J^^,.------' 

BSC0 ' 1 te^r^ mdLnof.esebostceUsbvu.additionof . 
containing 1he pHb40 vai BSCG-1 coding 

ag e», such as IPTO, however, results in the expresston of the B 

sequence. rxrpA-5 vector (SEQ ID NO:l 1) 

The promoter/operator sequences of the pHE4 Svl ^ 

• „ TSohaee promoter and two lac operator sequences, on p 
compnseaTSphagepr other „ locat ed 3' to the same 

site. These operators, when present m co 

c^^-*^^ « sequences located 

„ TPTG Expression of operauveiy 
inducer, e.g., IPTG. * P of a lac 

— ~ o^cl to me «, Pro- 

operon inducer, such as IPTG. »ui S & rf 

Lts m their release from me fc* operator rf gene 

• „,n P wp.rt in Devlm, T., iexibwn 
expression is reviewed in ^ 

ThepHmserie,ofvec to rscontama,.rfme-np ^ 
vector except for the BSCG-1 coding sequence. Fean*es P 
^iude optima synthetic T5 phage promoter, lac operator, 
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fl.es, sequences are also optimally spaced so that 
Delagarno sequences. Furfcer,meseseq ^ rf 

expression of an inserted gene may be ^ "gul 

expression occurs upon induction. production of 

^promote, Suitable euVaryotic promoters tncl^e * « ^ 

A. HSV thymidine kinase promoter, the early 
promoter, the HSV thym. as mose of the Rous Sarcoma 

promoters^promotersofretio^L^,^ 

Virus(RSV>,andmetaUothionempromoters,such 

Pr ° m0t ^ HEASvectoralso — aSrdne-DelgamoseouetrceS'toti.e 

' d . ^L-Delgarno sequences are short sequences genercll, 
AUG initiation codon. Staneueg ,„ tom me A UO initiation codon. 

These sequences essentially direct proKar* 

codon - • • ic^rectedtoexpressionvectorusefulfor 

„epresentinventio J,. This aspect of the 

mention is exempted b M« pQE70 _ ^ ^ 

^ng^rspreferredforusem ^ 

pQE-9, available from Qiagen^pBS ava ilable from Stratagene; and 

vectors, pNHSA, pNH16a, pNHlSA, pN ptemacia. 

Among preferred eukaxyotic vectors ar p pBPV> pMSG and pSVL 

^ pSO avaUable from ^"^^^ apparent to me 
avauablefromPharmacia. Other surtabl. vectors w, 
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protein, and - y include - -J^Jli^-.-* 

fce^logousfuncnonalregions. FormsUn rf ^ 

pearly charged amino acid,, may be » ^ 

polvpeptide to hnprove P— » ^ pepti de moieties 

^beaddedtothepolypepttdetofac^ P ^addition of peptide 

rem oved prior - final preparation of the to ^rove stability 

~**^»~~*zrz. i»i— 

and to facilitate purification, among others, are ^ 
..ear, ^*'^1^»* 0 * 
533 (Canadian counterpart 204586V) molecules together with another 

portions of constant region of immunoglobffl ^^^^^ ^ fusion protein is 

thoroughly advantageous for use m the py Qn teote 

— -^rZZZZZZZ- ^cpartafterthe 
^for some uses ttwouldbed^ ^ ^ 

fusion protem has been expressed, detected P to< ^ to 

j -vj This is the case when Fc portion proves to oe a 
«— ^ for example when the fusion protein is «o be used 

-^---^tlS-o very.forexample , human proteins, 
^^genforinunun— ■ ^ ^ ^ ^ ^ q{ 

-h as, h^S-receptoth-^ ^ 
high-throughput screening assays ^ ^ R 

Bennett e> a/., Joan* of Molecular Recogn.no < ■ 

270, No. 

Chanson e, aL. THc Journal of 



16:9459-9471 (1995). 



-19- 



W Ion -»~ chromatography, 

prcoip i«ion, acid ex^ouon, mKraction cbromatograpr*. 

^ .hromatography, ^^^ceU^do— graphy 
chromatography. Most preferably, ^ P rf ^ present invents 

("HPLC) « employed for p— on .? JP 

L^naruraUyP^P^^^ ^ a photic or 
^ pro duc« produced by yeast , bigber ^ and 

^odc^^ tT bo. e^ed in a recombinant 
«Us. i me presalt invention may be 

pro duction procedure, me p*»£~ » ^aes of the 

ere^insomecases 

inventionmay also mcluoe an 
^aresultofhost-mediatedprocesses. 

^on of me above ^ so me amino acid sequences of me 

, t „iU be recogmzed m the art ^ rf ^ ot 

BCSOl polypeptide can be «— - " 

^on of me protein. ^ on me protein W hicb 

.bould be remembered mat mere wrU be cnu 
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w^at* and type substitutions. As 
mcto de deletions, insertions, inverse ^ 

■^^.^^^^ ^^^ the 
phenol sUen, can be s— ««. 

Message in Protein Sequences: Tolerance to Amur 

2 ,,,306-131C , (WW rf ^ „ {Fipjre , 

substituted amino acid residue ma, or ^ 
a substituen, group, or (p) one m ^ 

— •^ lm ;.f 2°L ( I O PC fusion region peptide or leader or 
te5 d to the poiypepude, ^ au * ^ fc purificiti on of tie 

secretory sequence or a sequence * 

are deemed to be within the scope of those slailea 

teKta ' Of particuiar interest are substitutions of chafed amino acids wKh 
The latter results in proteins wi aagreg ation is highly 

can be immunogenic, (Pinckara . Therapeutic 

t . ?*.sttR 845 f 1987); Cleland ef al Crit. Kev. i ner F 
Robbins et al, Diabetes 56.838-845 

Drug Carrier Systems i0:307-377 (1993)). 
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As indicated, changes 



are 



acid substitutions that do not si! 



preferably of a minor nature, such as 
gnificantly affect the folding 



conservative amino 
or activity ofthe protein (see Table 1). 



TABLE 
Aromatic 



Hydrophobic 



Polar 



Basic 



Acidic 



Small 



1. conservative Amino ^Substitutions. 



Phenylalanine 

Tryptophan 

Tyrosine 

Leucine 

Isoleucine 

Valine 

Glutamine 
Asparagine 

Arginine 
Lysine 
Histidine 

Aspartic Acid 
Glutamic Acid 

Alanine 
Serine 
Threonine 
Methionine 
Glycine 



15 



20 



^ depends on »any — ■ X^ZT^ - - 

essetttial forfun«ioncanbe 1 den ti fiedbvm and Weils, 

a^mutagenesisoralanine-— single alanine 

Sci er,«2*M081-1085U989». The.aner P 

mutant molecules are 

r^ons atevery residue in «he molecule. TtoJ S ^ ^ 

^Uferaave activity. Sites that are cnttcal for hgand 
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• rh as crystallization, nuclear magnetic 

resonance ot photoarnnKy n a992 )). 

(199 2) and de Vos « * S*-» 255 ' 3 ° on ^ preferably provided in an 

^fonn. By p^ced and/or contained vAtinn a 

^o^hostceUisconside^vso^^ ^ tave ^ 

^ intended as an ^^^l^nostceno, sanative 

2T- W— • °- 57 ' 31 ^Loninciude*ec<, l ypep ti deencoded 
^poiypeptideso^epresentmv^ ^ ^ , „ ^ 

by^eaepo^tedcDKA-.apciypepttde^^ mprising ^o acids about 2 

minSBQID^^^apo^ ^ ^ are at least BOS 

^cai^orep^yati^^^ enK)ded b y the 

least 9 6%, 97%, 9S% - ^.^l^usEQIDNO^andaisoincinde 
^^•T^rl rsoLnoacidsandtnotep^iy 
portions of such polypepOdeswrm 

* least 50 amino acids. ^ least , for example, 

By apclypeptidehavinganan^o^ csqi is 

^^cai^-^ceaminoactds^ ^ ^ „ ^ 

intended tha, «- -*» - ^l^s^nle raay include up to five 
5 referencese^-^^ 

^BCSmpolypeptide. ta ^^^ an Ioacidse q uence,up»5% 
^^nceat^^idenncai-^ yte4 ^ orM ^ 

^another anuno acid, oranumber of anun 



30 
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cid residues in me reference sequence ma, be inserted mto the reference 
" aliens of me reference sequence may occur at the ammo or 
sequence. Tnese altera* ^ or 

m the reference sequence or in one or more contiguous gr P 

! ™ in Figure 1 (SEQ ID NO:2) or to me amino acid sequence encoded by 
7 lone Ibe dimmed convention* using**™ computer 

for Unix Genetics Computer Group, University Research Park, 575 
Vers.onSforUmx, ^ using Bestfi. or any other sequence 

Qripnce Drive, Madison, Wl 53 m. W11C11 & . 

95% idenuca, to a = ^^1^ * — 
^rr^ oTl refers »nino acid sequence and that gaps , 

-r:rrr::~a P e,de 0 r po^ 
======== ===» 

immunogen. On the omer nan * number of 

« • Aa(int >A « an "antigenic epitope. lne numuc 

siii^— — — — * 
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epitopes. See, for stance, Geysen « aL, Proc. ML Acad. Sci. USA 37:3998- 
4002 (1983). 

As to the selection of peptides or polypeptides bearing an antigenic 
epitope (i.e., that contain a region of a protein molecule to which an antibody can 
bind) it is well known in that art that relatively short synthetic peptides that 
mimic part of a protein sequence are routinely capable of eliciting an antiserum 
that reacts with the partially mimicked protein. See, for instance, Sutcliffe, J. G., 
Shinnick, T. M., Green, N. and Learner, R.A. (1983) Antibodies that react with 
predetermined sites on proteins. Science 279:660-666. Peptides capable of 
eliciting protein-reactive sera are frequently represented in the primary sequence 
of a protein, can be characterized by a set of simple chemical rules, and are 
confined neither to immunodominant regions of intact proteins (i.e., 
immunogenic epitopes) nor to the amino or carboxyl terminals. 

Antigenic epitope-bearing peptides and polypeptides of the invention are 
therefore useful to raise antibodies, including monoclonal antibodies, that bind 
specifically to a polypeptide of the invention. See, for instance, Wilson*^/.. Cell 

37:161-11* (1984) at 777. 

Antigenic epitope-bearing peptides and polypeptides of the invention 
preferably contain a sequence of at least seven, more preferably at least nine and 
most preferably between about at least about 15 to about 30 amino acids 
contained within the amino acid sequence of a polypeptide of the invention. 

Non-limiting examples of antigenic polypeptides or peptides that can be 
used to generate BCSGl-specific antibodies include: a polypeptide comprising 
amino acid residues from about 94 to about 107 in Figure 1 (SEQ ID NO:2); a 
polypeptide comprising amino acid residues from about 120 to about 127 m 
Figure 1 (SEQ ID NO:2).- As indicated above, the inventors have determined that 
the above polypeptide fragments are antigenic regions of the BCSG1 protein. 

The epitope-bearing peptides and polypeptides of the invention may be 
produced by any conventional means. Houghten, R. A. (1985) General method 
for the rapid solid-phase synthesis of large numbers of peptides: specificity of 
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• . a. l^el of individual amino acids. Proc. Natl. 
antigen-antibody interacts at the level utdtv 

.... o^.e.-ii.sns This "Simultaneous Multiple repuos ay 

Acad Sc. USA «.5131 5135. 1 ^ ^ 

(SMPS)"processisfurtherdescnbedmU.S.PatentNo. , 

" Tot* of sKll in the an v.1. annate, BCSO, ^ - *• 

Alined with pans of the constant domain of immunoglobuhns Jftffl. 
be combmed w,tn pans purificatJon 

^^^^^0 omainsofu.ehumanC^po.vpepndeand 

various domains 01 m» , xr ? ? ; -8d- 

vanou* TraU necker er a/., AW"** 331 ^ 

mammalian irmnunoglobulins (EPA 394,827, ir 

86(1988)). Fusionproteinsmat^^ 

le IgG part can also be more efficient in binding and neutrals o*er 
the IgG part can ^ fragment ^ 

molecules than the monomenc BCSG1 protem P 
(Fountoulakis et al, J. BiocHem 270:3958-3964 (1995)). 

Cancer Diagnosis and Prognosis 

^ere are two classes of genes affecting tumor development Genes 
fencing the cancer phenotype that act directly as a result of changes (e.g., 
influencing tne v -RRCA2 and p53, are called Class 

• +*v.p.r»MAlevel suchasBRCAl,o^/^., ouup , 

^Development of breast cancer and subseouent malignant proton s 
ria^witha!— ofavarietyofgenesofbothclasses. *^~>*« 

gl and, if sufficiently characterized, may yield novel molecular markers which 

may be useful in the diagnosis and treatment of human breast cancer. 

Tnepresentmventorshaveidenunedanewbreastcancerrnarkermatts 

overejlinadvancedinfd^breastcancerce^ 
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v. „„ breast eoiteMceUs and a weak expression in 
of BCSG1 innormalorbemgnbreaa ' rf BCSG1 indicates 

io.srade.^-inon^su,^- 

^ cancer maUgnant J f ^ proliferation because 

breast lesions. (See, Example 7). ement md treatment of breast 

BCSOln^vbe^inclm,™^ 

^ inthisregard.^eexpress-nof BCS^ ^ ^ m 

neoplastic epical cells of inf— ^ ^ 

wlls „f norma, and benign breast tissue^ ^ ^ ^ 

.evelexpression inflow gradem^ md ^ 

BCSG1 expression is assoctated w«oe of human breast cancer, 

^.emoreadvancedin^ive/me^^ ^ 

This implication is further supported * detec^ ^ ^ ^ 

^breastcancer celllmesdenvc^mp-a. ^ ^ ^ ^ ^ , 
6).BCSGloverexpressionmdu ^ a marked increase in 

^t progression^ V . L „ „ ,, WU 
DCIS incidence beginning in the y , mberofT X;iS cases in the United 
275 :913-9 18 (1996)). The total estimated 

— ^^^Cr^^ci^. 

^ , 973 and 1983 (Emster, V.L., < . rf ^ ^ m 

is currently unknown. There is ca of screentag mammography, 

ca ses*at are being diagnosed - ■ 

most of which are treated by some form of surgry ^ ^ ^ 

XO« 275:913-91* 0996 > BCSG1 exp ^ ^ ^ ^ 

, information on distmguishmg the DCIS w 
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some inappropriate or unnecessary masfcctomies. In addition, the use of BCSG1 
g ene could be of great importance in differentiating atypical prouferanve breas 

abnormalities. for'crvi 
B is interesting to note that the predict amino add sequence of BCSG1 

of Atotort dise.se (AD) amyloid precursor protein (Ueda, K., - a/., Proc. 
Mri Acad Sci. USA. 90(23,:11282-6 (1993)). A neuropathological hallmark of 
AD is a widespread amyloid deposition resulting from beta-amyloid precursor 
proteins (beta APPS). Beta APPs are large membrane-spanning proems ma, 
either give rise to the beta A4 peptide (AB fragment) 

Nat , Acad Set USA « ; 4245-4249 (1985)) or a non-Ap component of AD 
CUeda, >C, - a,., Proc. ** Acad S, USA. 90 (2 S,U^ (19^) 
« is either deposited in AD amyloid plaques or yielding soluble forms. Whue 
.heinsolublemembrane-boundADamyloido^ 

m us renders cell vuinerable to excitotoxic conditions of calcium influx resuUrng 
fromenergy deprivation or overexcitation (Mattson, M.P., e,al. An, *7. Acad 
Sci 57P121 (1993)), the soluble AD amyloid proteins are neuroprotective against 

.heir^euronalcAiumcon^^^ 

(19 95)) It is possible that BCSG1 , like soluble AD amyloid, may be potentially 
Lived in tissue damage resulting from tissue remodeling due to the local 
cancer invasion. Nevertheless, Examples 6 and 7 demonstra* a stage-specific 
BCSG1 expression and an association of BCSG1 overexpression with cunica! 
aggressiveness of breast cancers. BCSG1 overexpression may indicate breast 
cancer maUgnant progression from benign breast or low grade ,n si t u carcinoma 

to the highly infiltrating carcinoma. 

The Examples demonstrate that certain tissues in mammals with cancer 
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fceBCSGl pro^inwhencompared^acorresponding-.^nted"— 1 I t* 

amammalofthesame species nothavingthe cancer. Further, i, is beUeved that 
e^ccdieveUof^BCSGlproUincaBbede^incer^inbodytodsCe.g 

« plasma, urine, a»d spinal fluid) from mammals wim cancer whencompared 
to .era from mammals of .he same species no. having me cancer. Thus .he 
invention provides a diagnostic memod useful during tumor diagnosis, whteh 
involves assaying me expression .eve, of me gene encoding me BCSG. prote.n 
in mammalian cells or body fluid and comparing me gene expression level wth 
a stendard BCSG1 gene expression level, whereby an increase m me gene 
expression level over the standard is indicative of certain tumors. 

Where a tumor diagnosis has already been made accordmg to 
conventional memods, the present invention is useful as a prognostic indicator, 
hereby patients exhibiting enhanced BCSG1 gene expression will experience 
a WM se clinical outcome relative to patients expressing me gene at a lower level 
By "assaying the expression level of the gene encoding the BCSG1 
protein" is intended qualitatively or quantitatively measuring or <•*»*->*» 
,evel of the BCSG1 protein or the level of the mRNA encoding me BCSG1 
protein in a firs, biological sample either directly (e. g ., by detenninmg or 

tomeBCSGlprotemlevelornuWAlevelinasecondbiologicalsample). 

Preferably.meBCSGlprotemlevelorml^Alevelmmefirstbtologtcal 

sampleismeasuredorestirr^andcomparedteastandardBCSGl proteinlevel 
or mRNA level, the stendard being taken from a second biological sampte 
obtitinedtomaninmvidualnothavingmecancer. As will be appreciated* me 
art, once a standard BCSG1 protein level or mRNA level is known, it can be used 
repeatedly as a standard for comparison. 

By "biological sample" is intended any biological sample obtauted from 
an individual, cell line, tissue culture, or other sou^ which contains BCSG1 
protein or mRNA. Biological samples include mammalian body funds such as 
sera, plasma, urine, synovial fluid and spinal fluid) which contain secreted mature 
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bcso1 ^ - — h - P ^ ^ 

tt ^ — — ■ — ~ - ** ^ p,gs ' 

Tota l cellular KNA can be isolated from ■ „ 

nuclease mappmgffuj.ta era ^ ^ poJ^n— » -ham 

^onCPCRXreverse^u-scnpt.onmccmbma ^ 

(ut PCR1 (Makino et ah, Technique 2.295-301 » 
reaction (KT-PCRUM - t h the liease chain reaction (RT-LCR). 
Option in combination with the hgase ch ^ 
. „ R r S Qlorotein levels in a biological sample 
Assaymg BCSG1 pro expression in tissues 

antibody-basedtechniques. For exan.pl. BCSOlP^ 
eanhestudied^thciassical^olustolog.calm^ C ^ 

j- C* Bio/. i<-/-976-985 (1985); Jalkanen, M„ e, al., 

,05:3087-3096 (1987)^ _ ^ ^ detecting BCSG1 protein gene 

Oto antibody Wmethodsu*M ^^rbea, 
expression include immunoassays, such as the enzym 

Suitable labels are kno^im me art ^ 
Glucose oxidase, and radioisotope*, such as icdme ( ^ ^ 

such as fluorescein and rhodamine, and biotm. 
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Chromosome Assays 



5 



20 



25 



* ^r, «.r*» also valviable for 

.!„„ The sequence is specincauy "us 
chromosome identrficatron. The s q chromos ome. The 

hybridto with a parucular to— - present Mention is - 

important first step m correlating 

disease. ^ regard , the cDNA herein 

fc certain Purred ; B ^ m ^gene. Thiscanbe 

di ^isu S ed«oe 10 negeno n uoDNA^^ ra ^ ^ ^ 

accomplished using a vanety of w to is used for to «M< 

^rally - avaUable — «• *" ^ 
chromosome mapping using well Kno chromosomes by 

^^^^Vglis^-rapidiyselectprimersunatdo 
— — genomic OKA, thus complicating the 
«* - ™" ^ "p^J^nused^PCKscreemngof somanc 
amplification process. These prim 

Jh^dscon^inuividualhumauc^mo^e^ ^ ^ ^ ^ 

Fluo rescence , - k f ~ ^ ro i a pre cise chromosomai 
m etaphase chromosoma, spread can W« ^P ^ ^ ^ 

locanoni nones*p. 7^^, see Vermae, a/., 

short as 50 or 60 bp. For a revew of ft. ^ York 

(1988)- J .„ a nrecisechromosomallocation,the 
On-^-^te —me can be correrated with 
physical position of the sequence on 

^encmapda* --"^^^B^W"**-* 

* *"* wallable °° u " Benes and diseases that have been 
Medical Library. The relationship between genes an 
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^ — —~ «— — 

memutationisli^ 

Having generally described the invention, the same wi 

of illustration and are not intended as limiting. 

Examples 

b ac teI ia, expression vec.0, pQE* (P™) is 

, /thaCtEN Inc 9259 Eton Avenue, Chatsworth,C A, 
expressioninthisexample. (QIAGEN, Inc., 9 dcontainsa 
om n nOE9 encodes ampicillin antibiotic resistance ( Amp ) 
91311). P QE9enco p fori") an IPTG inducible promoter, a nbosome 
bacterial origin of replication ( on), an ^^ owaffimty 

, . .,• r»T?RS^ six codons encoding histidme residue* 

binding site ( RBS ), six co sold by 

QIAGEN, inc., W r* - ^ ^ . 

These elements are arranged such tnai an ,, XHfa 
^aeexp^s^^ae^^s.His^ues^.a 6XHis 

TT.eDNAsequen^encodingtodes.redpomonBCSGlpn. «* 

whi ch anneal » «» annno ^ * ^ 3 , P to ^ cD1)A 

BCSG1 protein and to fences in the deposited con*™* 
^ se q uence. Additional nucleotides — g reaction stes 
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Cooing in the PQE9 ve«or - added to the 5" and * primer sequences, 

respectively. - f 
For cloning the mature protein, the 5- primer has the sequence 5 
GGQC^ATGTITTCAAGAAGO 3' (SEQ ID NO:3) confining the 

,he anuno terminal coding sequence oftheBCSGl sequence in Figure 1. One of 
ordinary skill in the art would appreciate, of curse, that the point in the protem 
coding sequence where the * primer begins may be varied to ampUfy a DNA 
segment encoding any desired portion of the complete BCSG1 protein shorter or 
longer man the protein. The 3' primer has the sequence 5 
GGAAQCIICTAGTCTCCCCCACTCTGO 3' (SEQ ID NO:4) contammg the 
underlined HindHI restriction site Mowed by .9 nucleotides complementary to 
menon-ccdingsequenceoftheBCSGlDNAsequenceinFtgurel. 

The amplified BCSG1 DNA fragment and tie vector P QE9 are digested 
with BamHI/Hindm and me digested DNAs are men Ugated togemer. Insertion 
of me BCSG! DNA into the restricted P QE9 vector places the BCSG1 protem 
coding region downstream from me iPTG-inducible promoter and in-frame wrth 
animtiatmgAUGandmesixUstidinecodons. 

The ligation mixture is transformed into competent E. coli cells usmg 
standard procedures such as those described in Sambrook e, al, ***** 
Cloning: a laboratory Manual, IndEA; Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY (1989). E. coli strain M15tep4, containing mutaple 
copies of the plasmid pREP4, which expresses the lac repressor and confers 
kanamycin resist ("Kan™), is used in carrying out the illustrative example 
described herein. This stiain, which is only one of many mat are smtable for 

Transforms are identified by their ability to grow on LB plates in the presence 
of ampicillin arKl kanamycin. Plasmid DNA is isoiated from resist colomes 
and me identity of the cloned DNA confirmed by restriction analysis, PCR and 

DNA sequencing. 
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Clones co g ^ (100 ug/m l) and 

Kq "T^ ^cul^isusedtoinocu,^,.^^,- 

^ (25 ^ ° , . , 250 ^ are grown to an optical density 
dUationofapproxnnatelyl^toliSO.TheceUsar gr D . 

t-nnsOO"! of between 0.4 and 0.6. Isopropyi o 
* 600 " B ( 1™ is to added to a final concentration of 1 mM 
thiogalactopyranoside ^ ^ ^fmsac sensitive promoter, by inactivating 

to induce transcription trom me v 

1 lacl repressor. Cells subseauently are incubated further for 3 to 4 hours. 

CUs then are harvest by „ ^^.HCl, pH 

The cells are then stirred for 3-4 hours at 4 C in o M guaru 

8. The ceil aeoiiM /^rNrTA^ affinity resin 

BCSG1 is loaded onto a nidcel-nitriio-tn-acetic acid ( NvKTA ) V 

^(avaUableftomQ.AOminc.,^). "^^^ 
procedure (for defculs see: The QIAexpressionu*, 1995, QIAGE , 

^l—esof.Mguanidine-HClpH^a^ytheBCSG,, 

„ . „ nroeA or 50 mM Na-acetate, pH 6 butter plus zuu 
buffered saline (PBS) or 50 mm Mobilized on the 

Alterr*uvely,*eprote^ rena ture using 

Ni-NTA column. The recommended conditions are as follows ^ ™ g 

TToMlMureag^ 

a linear 6M-1M urea graai ^ renaturati on should be performed 

nH 7 4 containing protease inhibitors. The renatura 

dofl Shoursormore. After renaturation me proteins can be eluted 

° V T e di o fST^tad*. le- I— le is removed by a final 
by the addition of^50 rnM g ^ ^ ^ ^ 

dialyzing step against PBS or 5U nuvi 

NaCl. The purified protein is stored at 4°C or frozen at -80 C. 
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• itr<lGl protein in a Baculovirus 
Example 2: Cloning and Expression of BCSG1 pro 

Expression System 
BCSG1 protein, using a baculovirus leaa 

C*«»«-~ „.yhedri„ promoter 

" ^ Xba I and Asp718. The polygon sit. of 

simian virus 40 ( SV40 ) i ^.galactosidase gen, 

selection of recombinant vuus, the plasmid eon 
^.^und.conWofa^^^— - 
Mowed by the po.yadeny.aUon of u,ep*y ^ ^ 

to cloned polynudeot.de. ^ v<jctM 

Many other baculovirus vectors could be used « p 

J a,173 r»VL941 and pAcIMl, as one stalled m the art wo 
^e, such as pArf ^ ^ sigoal 

readily appreciate, as long as the ^.^asigaal peptide and 

-—-—^r ^r described, for instance, in 
an in-frame AUG as required. Such veer 

Luckow**;.. Virology 17031-19. d ittd ck>n e 

The cDN A sequence encoding the BCSG1 pr 
show n in Figure . (SBQ ID HO*), is ampUfied - gPCR oligo 
prfa iers corresponding to the S and sequences o *e ^ 
T v . 5> primer bas n 
qqqq^^QcGATGTTTTCAAGAAGG 3' (SEQ TC> NO:5) (the lowercase 
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BamHl — = - «^ ^^^protem. 
BCSO. protein shown.nF.gure '^^OJCTAOICrCOOOCACICrQO 
^ y primer ha, the sequence 5 ««^rAffi fey 

3(!>l*!UJi .iw-Vnoncoding sequence in Figure 1. 

Nucleotides complement <o*e 3 , ^ 

agarose gel. This fragment is designated herein Fl . 

Lriasmidisages^wthmerestrtcflonen^esBamHI/AspTlS 

Jolla, Ca.). This vector DNA is designated ^herein 

FragmentFlandthedephosphorylatedplasrmdVlare g 

' ,. -r,„HHB101 or other suitable E. coli hosts such as XL-1 

^ T4 DNA hgase. E col, HB101 or ^fonned with me 

Blue (Stratagene Cloning Systems, La oUa, CAW* ^ 
ligation mixture and spread on culture plates. method, in which one 

^piasmid^mehumanBCSG! ^^^^^ 
of me prm^ that is usedto amplify me gene andme second? 

fragment will show amplified of is designated herein 

fragment is confirmed by DNA sequencing. This plasm. 

^dt^ONAandS.go^epla^idpBacBCSOiarem.edina 
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• • ,,i r.f Qftrum-free Grace's medium 
11 «f * microtiter plate containing 50 ul ot serum ire* 

(Life Technology for ^ ^ room 
90 nl Grace's medium are added, mixed and in 

Thpn the transfection mixture is added drop-wise to ar* 

temperature. Then the transiec Grace's 

/ ATrr CRL 1711) seeded in a 35 mm tissue culture plate witn 

(ATCC CRL 1711) ^ forth to ^ ^ newly 

medium without serum. The plate is roc 

, , , „ The olate is then incubated for 5 hours at 27 C.AlterDno 
added solution. Theplateisth ^ ^ 

^ Q™ith suwa An agarose gel with 
~ mpA « described by Summers and Smith, supra, w s 
performed, as descno r ^ hersb um) is used to allow easy 

-mi - r Ta i« fLife Technologies Inc., Gaithersburgj 

1 * ,^ t u t v iet iDofamicropipettor(e.g.,Eppendorf). The 

* . rw„ t cfo p^Uq seeded in 35 mm aisnes. rvu± 

days later the supematants of these culture ox 

« . A o P The recombinant virus is called V-BCSG1. 
stored at 4 C. The recom ^ ^ ^ m 

To verify the expression of the B^aui g«= 

^ J - - - —j * . 

,.MOI«)cfabou. 2 . Sixhoursl^^meaium.sremov^and^^ 
( SFW n meditnn tninus methionine and cy*eine (avaUable from Ufc 
T, • Tnc Rockville MD). Ifn^olabeled proteins are desired, 42 hours 

Technolog.es Inc., RockvUle, W ^ ^ 

later, 5 nCi of "S-methioiune and 5 |iCi b cystein 



-37- 



M a The cells are ntrtor tocubaKd for 16 hour, and then toy are 
wadded. TheceUsare » »dl - ft. 

ha^ested by centrifugal ^ J ^ by ^^phy 

^Uu.arpro^ar.ana.yzedby SDS P^E ^ ^ ^ 

Of-*-** to amino — 

terminus of purified protern may be W ^ ^ 

sequence of to mature protern and thus to cleavag 

secretory signal peptide. 

A typical mammahan expression vector contains to promoter eiemen, 
Wch meltes to initiation of transcription of mRNA, the protean codtng 
^ signals required for to termination of transcript™ a*d 
sequence and „ mdu d e enhan cers, KozaU 

PO^'--^^^ flanked by donor and acceptor sites for 
"T ^^trar^pnoncanbeacHeved^mtoearlyand 
KNAspUcing. Highly efflcient trans v retroviruses, 

. fi™. W40 the long terminal repeats (LTRs) trom tu. 
We promoters fromSV40, the g cytomegalovirus (CMV). 

HTLV1 HIVIandtheearlypromoteroftocytomeg 
e.g., RSV, HTLV I, Hi ^ promoter) . 

However, cellular elements can also be used (*g , clude> fol 

"-^T: f ;^r^^ Oppsai, Sweden, 
exampl e, = suc * PSVL and ^ pBcmfl (ATCC 

6710 9). «™- ta » ta ^^ Bd C127 ^ Cos 1 , Cos 7 and CV 1 , quail 
and Jurkat cells, mouse NIH3T3 and t-i 

, ii Chinese hamster ovary (Criui ceus,. 
QCM cetts, mouse L cells and ^ ^ ^ 

Mternatively, to gene -^^^ ^ a sel ec»b,e 
isolation of the transfected cells. 
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^egeneof^res. A.o.be.useMseUc _ 279(i99i);Bebbingtone((li ., 
^(OS^urphy^ 

«>' 169 - 175 (1992)) ' r«Us the bighe* resistance are 
^ ^ in selective medium -* £ , totegrat ed into a 

sd e*ed. *"* W -*1*«0*— 

. rvrinese hamster ovary (Oriu; 
chromosome. Chinese 

production of proteins. ^ contain the strong promoter (LTR) 

The expressionvectorspCl „ molog y, «S- 

o^RousSarcomaVirusCCuUen^.^^^^,, Ce „ 

447 (March, 1985)) plus a fragment of ^ eozsroe 

,,,,1-530 a9S5)). Multiple ^ cloni ng of the gene of 

Agression in COS Cells 
Examp le3(a):aonm^ Ex P reSSW 

„™r, HA is made by cloning a cDNA 
The egression plasmid, pBCS ^ ^ (which 

^gBCSGlmtomeexpresstonvectorp 

^heobtameafromlnv^oger,^ ^ _ £ ^ otigta of 

The expression vector pcDNAVamp ^ ffl „ 

plication effective ^^^^ pro^aryotic celU; 

Ip^Ulin resist gene for eeUs; (4) a CMV 

J anS V40o^o,replicationforpro^uo ^ ^ , 

5 plmoter, a ^ - ^tate purification) followed by 

atermmationcodonandpolyadenylation gn 
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. nf the CMV promoter and operably 
Uriced to the SV40 intron and the p* jr tom ^ 

of the recombinant proteuiwi Weneomyc in marker. 

^M*m— csst--- into *. 

A DNA fragment encoding the o . n is airected by the 

eOKAoftnedepo^aonets^™ ^ ^ rf vectors fM 
action sites, much - ^ ^ foUowing , whi ch 

a Kozak sequence, an AUG start ^ 5 

^ complete BCSG1 _ iifiG3 , (SEQ IDNO-.T» (Kozak sequence 
0«^-cATGTmCA^AAOG3 *Q 

■"—^'ZZ .tSon.codmgsequenccKasto—g 
Bamffi site, a stop codon, and P ^ end) . 5 

• • 1 U 6 ° ° ' ( " ' ' eetoetalgggtaCTAGTCTCCCCCACTCT 

GGO^TCAgaaagcg.gtctgggac^^ eiowereaseletters) . 

«, CSBQ--^-^ J vector , - 
The PCR amplified DMA tragm ja^e is trar* formed into 

^^thBanM andthenU^ ~ - ystaM> U099 North 
30 fragment. 
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For expression of recombinant BCSG1 , COS cells are transfected with an 
expressionvecttr.asdescriWabove,^ 

iBstance, in Sambrook e, al.. Molecular Cloning: a Laboratory Manual, Cold 
Spring Laboratory Press, Cold Spring Harbor, New York (1989). Cells are 
incubated under conditions for expression of BCSG1 by the vector. 

Expression of the BCSOl-HA fusion protein is detected by radiolabehng 
and inununoprecipitation, using methods described in, for example Harlow e, al, 
Antibodies: A Laboratory Manual. 2nd Ed: Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York (1988). To mis end, two days after 
transfection, the cells are labeled by incubation in media containing »S-eysteme 
for 8 hours. The cells and the media are collected, and the cells are washed and 
,ysed with detergent-containing RIPA buffer: 150 mMNaCl, 1% NP-40, 0.1% 
SDS 0 5% DOC, 50 mM TRIS, P H 7.5, as described by Wilson e, al. cited 
above Proteins are precipitated from the cell lysate and from the culture medta 
using an HA-specific monoclonal antibody. The precipitated proteins then are 
analyzed by SDS-PAGE and autoradiography. An expression product of the 
expected size is seen in the cell lysate, which is not seen in negative controls. 

Example 3(b): Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of BCSG1 protein Plasmid 
pC4 is aderivative of the plasmid pSV2-dhfr (ATCC AccessionNo. 37146). The 
p,asmid contains the mouse DHFR gene under control of the SV40 early 
promoter. Chinese hamster ovary- or other cells lacking dihydrofolate activtty 
that are transfected with these plasmids can be selected by growing the cells m a 
selective medium (alpha minus MEM, Life Technologies) supplemented wtth the 
chemotherapy agent methotrexate. The amplification of the DHFR genes m 
cells resistant to methotrexate (MTX) has been well documented (see, e.g., Alt, 
F W., Kellems, R. M., Beruno, J. R., and Schimke, R. T., 1978, J Biol. Chem. 
253 1357-1370, Hamlin, J. L. and Ma, C. 1990, Biochem. e, Biophys. Acta. 
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DHFRgene. Ifasecondg ^ approacb w be 

^pUfied and over-expressed. It is *no ^ ^ ^ 

cl,romosome<s)of<he host cell. _ 

Plasmid pC4 contains for expressing the gene 01 
Plasma p . , .„ ^ ofthe Rous Sarcoma Vuus (Cullen, 
^oter of the longtermmal repea ^ pltts . fragment 

e, o; ., W o^WCe«^B.^. March 1985.43^ 7,^ 

isolated from the enhancer of the u— - * 

i ™ (rMVl (Boshart et ol, Cei/ 4J:521-530 
^megalovrus (CMV) C* ^ 

o^epromo^areBarnm^^ ^ ^ ^ ^ 

1 ^aUon site of the rat prep— gene. 

r ^ — ta - - = - ^ 

the SV40 early or late promoters or the long ««— 
human p-acnn promoter, the SV40 * clontech's Tet-Off and 

^ from other rettoviruses, e.g„ MV and HILVL 

BCSG1 mareg U la^ W aymmammaUancells(C3ossen,M, & Bu J ard 
BCSGlm gu. 551) . Fotm epolyadenylauonofthe 

M S«. «M S9 _ ^normoneorglobingenescan 
mKNA other signals, e.g., from the human gro ^ 
b.usedas.e.1. StableceU lines carrymg ^« ^ ^ 
^niosomescanalso.sele^upon^^ ^ _ 

such as gpt, G418 or hygromycm. It ' ^ 
selectable marker in the begummg, e.g., G418 plus m 
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The DNA sequence encoding the BCSCjI proiein " 

The 5' primer has me 
^O^ccaccATOTTrrCAAOAAGO V (SEQ ID NO:7)(Kozak sequence 
^^Tto *e lowercase letters) containing the underlined BamHI 

^ 1«M •/ Mol Biol. 796:947-950 (1987), and 
in eukaryotes, as described by Kozak,M,^.M , ,o F o ID NO D 

.Sbasesofmecodi^sequenceofBCSGl shovmmFtgure 1 (SEQIDNO.l . 
' 3 . ptimer has «he sequence 5 

GGGGTACCTCACTAGTCTCCCCCACTCTGG * (SEQ ID NO:9) —g 

f a- RPSGl eene shown in Figure 1 (SEg uj 
to the non-translated region of the BCSG1 gene 

NO:1 )- . j- «nth the endonucleases 

The amplified fragment rs mgested wtth the 

BamHIM^B and then purified again on a 1% aga.se ge. The — 
^entandu.depho S plK,r y la.edvectorare to Uga^^T4DNAh^ 

^,-HBlOl orXL-1 Blue ceUs are then transform*, and bactena are rdenhfirf 

the plasmid pSV2-neo using Infect* (Feigner , A W £ 
psleocontains adon^t^.ernaHcer.u.neogenefromTnS^g 

eeUsareseene.malpb.rninusMEMsupp.emen^^ 1 -^J^ 

Germany) in alpha minus MEM supplemented ™<h 10, 25, or ng/ 
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, , 04 « After about 10-14 days single clones are 
metothrexateplus 1 ^shes or w ml flasks using different 

""^^ 100 nM,200 nM ,400 nM, 800 nM). 
eoacentrauons of meurotrexate (50 nM^ ^^transferred 

w M w 6-weU plates — ^Lsarne procedure is repeated^ 
(1 uM, 2 uM, 5 nM, 10 mM, 20 mM) 

Motor by reversepnaseHHX analysis. 

• ,i out to examine BCSG1 gene 
Horfcem b,t ^J^l^.ase^norn tissues 

expre ssion inhumantissues as follows- P . * Saccni, 

^gto^nret.odof^c^Sacc^ ^ ^ ^ _ 

K, >««■«• Bioctem. J«:156- \ _^ oima ldehyde and transferred 
„horesed in a 1 2% agarose gel — S ^ 
tonylonmembraneCBoel^gerMan^enn) _ 6? . c for 3 0 minThe 

^ExpressHybhybridizationsolunonCCl n«*to O m ^ 

p . 5 x 10 < cpm/ml) for 1 hour *^ ^^li- 
SSC confining 0.05% SDS uu« tunes 3 ^ ^ ^ ^ ^ 

by wo washes with 0, , _ ^ from the Bluest 

full-length BCSG1 cDNA (SEQ ID «>•» . template for 

vector, following EcoKI and °* j^rfoe Random primer DN A labeling kit 

purchased from Amersham. ^ expressed as the 

The northern blot showed that d ^ 

• ♦ ;« Wn which is a rich source ior ^ 
expected 1 kb transcript in brain v/ru 
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tissues: ovary, testis, colon, and heart 

5: OomnsofBCSGlfrom cDM UtrarUs 

breast cancer versus normal breast. A da* base containing approxuna** 
human partial cDNA sequences (expressed seouence tags) h- been 

and Human Genome Science Inc., usmg mgh throughpu 

sequence analysis of random., selected human cDNA clones (Adams, MD.^r 

niched norma, breast .ere isolated and subiected to prepay 
DNA Ubraries EST automate4 DNA sequence analysis was performed on 
OOKA Cones. Bom Ubraries had about «H nov. gene 

3048 ESTs ftom breast cancer cDNA Ubrary and 2886 

cDNA Ubrary were randomly picUed and sequenc, ^ 
overling sequences were grouped into unrque EST g»ups - 
^upmayrepresentageneorafamilyofsequence-relatedgene, Therew 

2 2 2W EST groups mat were analyze* for quantitative companson of 

r^^expressionofindividualESTseouenc.s. The numbers of EST 
LTTL- reflect to relate express or mKKA t^t-P, 
. oftheEST This direct differential cDNA sequence, as illustrated in 

expression profile of individual ge M s and patterns of genes » normal breast 

versus breast cancer. 
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Results 

onse^neeoverUppn. ^l^^^ve^^ca^ 
relat ive express** m the cDNA ^ ^ rf 

and then subject to assue-specrfic expression y 

individua! EST fences agafcs. a .arge -^^1^ « 

were dually ^ - ^^^wrfto.*-.- 

demonstration of to approaeh, Table ' \ *»» p breast ^ 

ge nes M are drfferentiaUy expressed » normal breast " 

degrading proteinase (Roehefort, H., er A J. Cell 

sr. r-rrr^-, b 

1 8^6-24 (1991); Capony, F. - A ««*~ 

Ccmmm J 7J -.972-80 (19W» 
^caneerprogr^onCBroume,,^,^ 

, r / 7 Mil 5-8 (1989); Rochefort, H., ei •> 

*— t ^ - — — ° DNA 

"^•"TrVBstrrrorb^c^ubrary. - 

library and only 1 EST in the norm 

67 kDa lamininreceptor (Horan-Hand, P., et al., c 

Hunt, G. Exp. Ce« Bio, 5^:165-76 (1989); Castronovo, V., , A ^ 
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to to cuss by «he ^^^^^.baninbrcas. cancer, 
represents genes that are more abun m 
Although the genes m classes ^ ^.^ ^ brMSt 

^ breast versus breast ^ ^ expre5sed * 

Ussue, Cass HI is a spectal group* g ^ ^ 

br eastrela»ve,oo,her n ssue types. 50 P 0 000 £STs using the BLAST 

ge ne was searched against app— * ( (199 0),. None of 

program (Altschul, BI,*A ^ ' ^ one ^edwith any 

.hesebreastcancerspecificgeneslB BCSG , was chosen for analysis 

fences in pubhc gene „ to sequence has been 

as a first P-f breasl — * database; and 2) most of the 

individualESTsequencesmBCSG SCTenof them were discovered tn 

Of the eight distinctive EST clones » BO. ^ ^ £ST dones 

taeas, o0NA libraries and only one tn b ^ ' ^ ideIltified m the breast 
^eredm.ebreastcn.Alibran^^^ 

^nor library and only one » ^.h-.rt* 

sequencing of ail 6 EST clones, one ^^^^ 0 f the resulting full-lenglh gene is 
^engthsequence. Theopenre^mg ^ opt ^ alignment, the 

pasted to encode a 127 ammo actd polyp P ^ ^ ^ 

^veBCSOl^ded^^- ^ ^ ^ ^in 

C Ueda,K.,et 0 f..-P'- 0 c.W<«/.^ al t 
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Genes more al 
Genes 



^ant in t^east cancer 



Breast basic conserved gene 
CathepsinD 

67kDalaminin Receptor 
Elongation factor 1 



Cancer 

33 
5 
4 
13 



EST 



"Normal 

9 
1 

0 
5 



Genes 



Cancer 



EST 



Normal 



30 



Genes 



NB1 



1 
0 
0 
4 
0 



EST 
BC 2 

6 
7 
5 
0 
4 



All Tissues 



8 
7 
5 
4 
4 



BCSG1 
BCSG2 
BCSG3 
BCSG4 

ctaee III breast carcinoma 
v*h more ft- 3 h~WP«* ^ h ^ lbt 
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6: ^ess™ ofBCSOl in Hunuu, treas, cancer ccUs 
„, an attempt to evaluate to potential biological significance ofBCSOl 

iso lationkitRNA Z olB(TelTest,in, coining 
Eo.ua, a^uofcof^Aweree,^^^^^ ^ 

^branewaspte-hybridizedw.thExpressHybhyon 
bc ,a,6S°Cfor30nun Tu«hyl>ndiza»onwascarnedou „ to ame » 
!tt »P WW- BCSG1 probe (1.5 x 10' cpm/ml) for 1 hour at 68 C. The 
** P . , in , xSSC contaming 0.05% SDS three times for 30 

membrane was then nnsed m 2 x SSC contauung containing 
Mnnerature Mowed by two washes with 0.1 x Sbl, con 

mm at room tempera*^ f Ql cDNA(S E Q rD N 0:l) 

n 10/ ens for 40 min at 50 ^. me 1U11 . , 
0.1% ior w im" t ^ges^o^ and 

was isolated from the Bluescript vector, following probe Random 

^ as atem P lateforpreparat i o„o f arandom. 1 abeUed^rob a 

p^erDNAlabeUr^Btwasob^edfromBoel^erM^eu^Inoi 

32p. dATP was purchased from Amersham. 
' legion of BCSOl ■» — ^ «~ f ^ 

^rip. in breast carcinoma. ^— ^inhuman 
undete^lembenignbreast^^^-o ^ ^ 

breasttissue and its overexpresstonm breast car iblero i eor 

aifferential cDNA sequencing cloning strategy which «W~ cancer. 

i RCSG1 in the development of breast cancer. 

abiomarkerofup-regulat^onofBCSGlm .. avarietyofnuma n breast 

• ^fxar^ni was also examined in a variety oi 
The expression of BCSU 1 was <u H3630, 

H3680B; pleura, elision derived ceU lines MC 7^ 

MDAMB23 1 ; infiltrating ductal carcinoma derived cell hnesH3914, H3 
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f TA7n ZR-75-1 SKBR-3,MCF-7andMDA-MB-231 
7J.1. T47D. Cell Unes ofT47D, 7K 75 1 Bristol . Mv6ts Squibb 

„ from ATCC; all other lines were unttaUy -solated 

Nor^nblotdefcctedthe 1 md .» 4 M „ Unes 

derived from pleural efluston (t.e., SKBR UO ^ 

a— ^ — ^bcsoi^. Among 

^r^ptltZe.levelo.BCSO,^ -absence 
tee Unes,H3922 expresse ^ ^ ^ 

ot BCSOl mRNA in some pleural effuston den ^ ^ ^ 

expression of BCSG1 gene may require sr*cfic » «vo con 

^ 7.- J, «. Hy^nofBCSOX in »reas, cancer ceHs 
infutrating carcinomas, 15 msim 

hyperplasia and fibroadenoma). r & RC 

^-^;TTirodtdB. D .Hames(ed.,^nd„n.. 

LRL Press.,(1992) >P p. 15-32). Bnefly d p hybridized 
(5 .um thick) were treated with protetnase K, pre nyb 

— ^ ^ -probe 

« general by M * , ^ ^ ^ent 

po.ymeras, H^onwasfoU^^ ^ 

^.Ito^ Sections were mcubated wtth m ndaryrabW t 

CBo^^by.— „ _ ^ 

anti-mouse antibodies (DAKO). Thee 
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UmversalUSABKltaccordmg^a exaM ned: 
Inmesee*per».en*, W o^ ^ m ,^ me co— of 

„ the tissue ^J^Jant phenol A — * 

^omas was observed. The exp ^ ^ No 

4. neoplastic epical cells » « ^ to oontrast, expressron 

of BCSOl was absent » aU 15 ca*s ^ ^ proUferalive breast 

» «*» " ^ normal dud breast epical cUs are 

pr esen*d. Furthermore, in ^ norma l lobular breast 

signal ofBCSGl ^"^'^We breast carcinoma cells were 
epit helial ceUs although the surrountog m ^ 
sLnedposUiveforBCSOle^J^ ^ a ^expression of 

B is interesting * «>«e that aUh g ^ ^ 

^^ematignantbreastep^^^^^^. 

iB sin , carcinomas, 9 specunens we« s«n^ ^ ^ rf ^ ^ 
^stained. Both.hemtens.tv of s^^ Ulnomas compared «>«he strong 

det ectable expression In normal - *J ^ M expression 

^^^CrT^^ana^clatio nofBCSO! 
totoinffltrattogmaUgnantbreast 
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■ ^ breast caacer malignant progression. Based on .his BCSG1 

, .1, be dear that * — .a, be pra^ — - as 

j • t^P foreeoing description and examples. 
Numerous modtficatrons ^ ^ rf ^ 

possible in light ofthe above teachings and, therefore, 

appended claims. oub Ucations (including patents, patent 

^ euto disclosure of all pubucatro l 

applications, journal articles, laboratory manuals, boo*, 
eited herein are hereby incorporated by reference. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) A PPLICANTS: «££ g A. 

4- r*neer Specific Gene 1 
TITLE OF INVENTION: Breast Cancer 

NUMBER OF SEQUENCES : H 
^ CORRESPONDENCE ADDRESS: Goldstein & Fox P.L.L.C. 

(C) CITY: Washington 
ID) STATE: DC 
E COUNTRY: OSA 
Jf) ZIP: 20005-3934 

{v) COMPUTER ^ABLE FORM: 
tV1 (A ) MEDIUM ^PE. Floppy 

( ( C) CLASSIFICATION: 

- p fi^P- 08/613,281 

(S) CLASSIFICATION: 
S CLASSIFICATION: 

(2 ) INFORMATION FOR SEQ ID NO:l: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 12. .3 9<£ 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

_ m TP T rr ATC GCC AAG AAG 

CACGAGCCAC C ATG GAT GTT TTC AAG AAG « ™ ™ „. M . Lys , ys 



1 5 



s s a a a !S a a s a a ss ss a a a 

15 20 

r»r rrr GTC ATG TAT GTG GGA GCC AAG ACC AAG 

ST. SS IS SS SS ?S S * ™ «y »*■ T» 

30 35 

SS S SS a SS S2 a S E S3 S SS a S t£ SS 
SS S K S SS a 5! E SS SS S a SS SS S !S 

G cc R cc I « «» «o s - « ?r. a ss a a ss 

Ala Thr Lys Thr Val Glu Glu axa 9Q 
80 a3 

ss ss a ss a a a a a s a a a a ss a 

ss a a a a a ss a ss a ss ss a ss a i 

GAC TAGAGGGCTA <^L»* TGGATGACCT GAAGAGCGCT CCTCTGCCTT 

Gly Asp 

GGACACCATC CCCTCCTAGC ACAAGGAGTG CCCGCCTTGA GTGACATGCG GGTGCCCACG 

CTCCTGCCCT CGTCTCCCTG GACACCCTTG GCCTGTCCAC CTGTGCTG 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid ^ 

(D) TOPOLOGY: - linear " " 



50 

98 

146 

194 

242 

290 

338 

386 

442 

502 
550 
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(ii) MOLECULE TYPE: protein 
Bet « P V,! P M * ~ 'f. " a 

v,! Glv Ala Lys Thr Lys Glu Asn Val 
Th r Lys Glu Gly Val Met Tyr Val Gly Ala 

35 c val Ala Glu Lys Thr Lys Glu Gin Ala Asn 

val G ln Ser Val Thr Ser Val Ala Gl y g0 

50 t s Ala Val Val Ser Ser Val Asn Thr Val Ala Thr Lys 

A la Val Ser Lys Ala vai 15 

65 „ i ser Gly Val Val Arg 

Thr vaX Glu Glu Glu Asn lie Ala Thr Ser 

„ ,eu Arg Pro Ser AX. Pro Gin Gin Glu Gly Glu Al, Ser 

Lys Glu Asp Leu Arg tr*v 1Q5 

^ Glu I Glu VI Ale Glu Glu Ale Gin Ser gj Glv - 

115 

(2 ) INFORMATION FOR SEQ ID N0:3: 

(i > SEQUENCE CHARACTERISTICS: 
U) ( A ) LENGTH: 23 base pairs 
fB TYPE: nucleic acid 
C STRANDEDNESS : single 
(D) TOPOLOGY: linear 

MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGGGATCCAT GTTTTCAAGA AGG 
(2 ) INFORMATION FOR SEQ ID NO: 4: 

M> SEQUENCE CHARACTERISTICS: 
(1) SEQUEN<^ gtH: ^ base ? 

B TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(i i) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
GGAAGCTTCT AGTCTCCCCC ACTCTGG 
(2) INFORMATION FOR SEQ ID NO:5: 
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... SEQUENCE CHARACTERISTICS: 
(1) SEQUEN^ gth: ^ base 

B TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Mi) MOLECULE TYPE : cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGGGATCCCG ATGTTTTCAA GAAGG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 27 base pairs 

(B) TYPE: nucleic acid 
(C STRANDEDNESS: single 
(D) TOPOLOGY: linear 

MOLECULE TYPE: cDNA 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGGGTACCCT AGTCTCCCCC ACTCTGG 
(2) INFORMATION FOR SEQ ID NO:7: 

m SEQUENCE CHARACTERISTICS: 
U) ^LENGTH: 29 base pairs 
IB1 TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: cDNA 



27 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
GGGGATCCGC CACCATGTTT TCAAGAAGG 
(2) I N FORMAT I ON FOR SEQ ID NO: 8: 

seoUENCE CHARACTERISTICS: 
U) ^A) LENGTH: 60 base pairs 
/o\ TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii ) MOLECULE TYPE : cDNA 



29 



(xi , SEQUENCE DESCRIPTION: SEQ ID NO:8: 
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AGAAAGCGTA GTCTGGGACG TCGTATGGGT ACTAGTCTCC 



60 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

30 

GGGGTACCTC ACTAGTCTCC CCCACTCTGG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



CGGACGCCTT 


TTTTTTGAAT 


TCGTAATCAT 


60 


ATCCGCTCAC 


AATTCCACAC 


AACATACGAG 


120 


CCTAATGAGT 


GAGCTAACTC 


ACATTAATTG 


180 


GAAACCTGTC 


GTGCCAGCTG 


CATTAATGAA 


240 


GTATTGGGCG 


CTCTTCCGCT 


TCCTCGCTCA 


300 


GGCGAGCGGT 


ATCAGCTCAC 


TCAAAGGCGG 


360 


ACGCAGGAAA 


GAACATGTGA 


GCAAAAGGCC 


420 


; CGTTGCTGGC 


GTTTTTCCAT 


AGGCTCCGCC 


480 


: CAAGTCAGAG 


GTGGCGAAAC 


CCGACAGGAC 


540 



TATAAAGATA CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC 
TGCCGCTTAC CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCATA 
GCTCACGCTG TAGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC 
ACGAACCCCC CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT CTTGAGTCCA 
ACCCGGTAAG ACACGACTTA TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG 



600 
660 
720 
780 
840 



,00 



GAGTTCT TCAAGTGG.G ^« GGCWACTO g 

cgaggtatgt aggcggtgct acagagttc toottcgga 

GAAGAACAG, — GG — GC 

GTAGCTCTTG A.CCGGCAAA ° ^ TCTRCGGGGT 

.GCAGATTAC GCGCAGAAAA „ GGTCRT G R G, ^CGA 

CTGACGCTCA GTGGAACGAA *^ RCRCCMC5 A ATGGTGCAAA 

^CGCGC GCGAAGGCGA AG« tc wttcagggt gg tgAATGTG 

ACCTXX CGCG G^GGCA.S — *~ tMGCCGGTG TCTCTTATCA GACCGTTTCC 
„AA CGTTATACGA --AGA * ^GCGGCG 

cgcgtggtga accaggccag ccacg^c wctggcggs <aMCMIC , 

ATGGCGGAGC TGAAT.ACA, TCCC^ CGCCGTCGCA AATTGTCGCG 
TTGCTGATTG GCGT.GCCAC CXCCAG^ OC : 

gcgattaaat ccgcgccga .caac^ tcgcgcwcg cGiraGTCGG 

agcggcgtcg aagcc T g T aa agcggcgg- «c» ctctggwgc tgcctgcrct 

CTGATCATTA AC^CCGC. GGA--G ^ 

AATGTTCCGG CGTTATTTCT — - tcGCATTGGG TCACCAGCAA 

tcccatgaag acgg»cgcg ---;; t -: tcggcgc GTCT GCG TCT 
atcgcgctgt tagcgggccc attaagt- ^ gggwgg cgrctggmi 

CATAAATATC TCAC.CGCAA TCAAA «** COG* 

gcca TCT ccg g^caaca aacc-c- gcgccattrc cgrgtccggg 

ATGCTGGTTG CCAACGA.CA GA.GGCGCTC ^ CAGCTCATGT 

cTO CGCG TTO 0— r^CGCC TGCTGGGGCA AACCAGCG.G 

tatatcccgc cg TI aaccac ca^* - GTTGC CCGXC 

GACCGCTTGC TGCAACTCTC TCAGGGCCAG 

TCACTGGTGA AAAGAAAAAC CACCCTGGCG CCCAA cGGGCAGTGA 

TTGGCCGATT CATTAAXGCA GC.GGC-- TCGTGCCTCC 
GCGCAACGCA ATTAATGTAA GTTAGCGCGA 

CGACCCGG AGTTCGGGGG CA,GGA T « ^ 

GACGGA^O CACGCCGG. ^ gA.CCCCGCG 
CCAACTCGCG AGGGGATCGA GCCCGGG-G ^ TTTCATAGAA 

cggaggaxc atccagccgg — G ^ g ^ gttgggc gtcgOT gg T cggxca^c 

GGCGGCGGTG GAATCGAAAT ^ « GATGCGCTGC 

^ccccaga gtcccgctca GAAGAACCG tcaa 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



rRGCCCATTC GCCGCCAAGC 282 

gaatcgggag c— «= >z:;z — — : « 

TCTIcM CAA -CC»CO« A ccrtgmrtt CGGCWGCRS - 

cggccacag, cgaxgaa.cc agaaaag- - xgcgcgcctt gagccggcg 3 

GCATCGCCA. GGGTCACGAC GAGA.CC^ - MCG ACAAGA * 

Iacagttcgg ctggcgcgag cccc^oc T « tcgcttggtg gt 

CCGGCT.CCA TCCGAGTACG ^CGC.CG AT 

icTAGCCG GATCAAGCGT "--^ C ^ TGCCCCG gCACTTCGCC CAATAGCAGC • 

tcggcaggag caaggtgaga tgacagga» - C eGCRRGGW c GCCCGTCGTG 

1^ CCGCTTCAGT ^ TCAGGGCAGC GGACAGGTCG 

GCCAGCCACG ATAGCCGCGC TGCCT^C » GGcMC AGAG 
■ gTCTTGACAA AAAGAACCGG GCGCCCCTGC O 

CAGCCGATTG --^^g ^ TGTCTCTTGA 

^GT GCAATCCATC TTGTTC»TC * ccAGTTTACT 
TCAGATCTTG ATCCCCTGCG CCAT^- „ 

„gcagggct tcccaacctt -^^^ cactgcaagc tacctgcttt 
^ — GTC "ag atagcccagt agctgacatt catccggggt 

CTCTTTGCGC TTGCGTTTTC * tCCTTTAGCA GCCCTTGCGC 

cagcaccgtt tctgcggact ggc^c- ^ ^ trsttt ^ 

CCTGAGTGCT TGCGGCAGCO - jyjCGGATAAC AATTTCACAC AT— 

cggataacaa ttaagatgta cccaa 
agaaattaca tatg 

• MOLECOLE TYPE: cDHA 
AAGCTTAAAA AA— TRMGRGGRG ^TACATA TG 

caattgtgag cggataacaa tttca 



3660 
3*7 20 
3T80 
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3900 
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What Is Claimed Is: 

, An isolated nucleic acid molecule comprising a polynucleotide 

^ bv the cDNA clones contained in ATCC Deposit 
amino acid sequence encoded by the cDN a 

No 97175 or 97856; and 

W a nucleotide sequence complementary to any of the 

nucleotide sequences in (a), 0>) or (c). 

2 An isolated nucleic acid molecule comprising a polynucleotide 
w hich encodes me amino acid sequence o f an epitope^ ^ ■ 
BCSOl polypeptide having an amino acid sequence « (a), 0», (0 W 

1. 

3 Tto isolated nucleic acid molecule of claim 7, which encodes an 

„f . RCSQ1 polypeptide selected fan the group 
cpitope-bearing portion of a BCSG1 po yp P ^ 
of- a oolypeptide comprising ammo acid residues nom 

«"""-"». ct ' P °7' roww andapolypeptidecomprismg amino actd 
aboutl07inFigurel(SEQlDNU.^,an f ,r 

resid ues from about HOtoabou, 127 inFigure 1 (SEQIDNO.2). 

4 . An isolated nucleic acid molecule, comprising a polynucleotide 
having a sequence sleeted from the group consisting of: 
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W the nucleotide sequence of a fragment of the sequence 

nucleotides of SEQ ID NO:l; and 

(b) a nucleotide sequence complementary to a n 

sequence in (a). 

5 A^forn^arecombmantvectorcomprismgutsering 
an isolated nucleic acid molecule of claim 1 into 

6. A recombinant vector produced by the method of claim 5. 

7 A method of making a recombinant host cell comprising 
introducing the recombinant vector of claim 6 into a host cell. 

g. A recombinant host cell produced by the memod of claim 7. 
9 A recombinant method for producing any of the BCSG1 

conditions such that said polypeptide xs expressed 
polypeptide. 

10 AnisolatedBCSOlpolypeptidetavmganaminoacidsequence 
aUeas. 95%identica, to asequencese.^ to »me g roupcons^ 

(a) aminoacidstromaboutl.oaboutmmSEQIDNO.l 
0.) aminoacidsfromaboutZtoaboutnVmSEQIDNO* 
(c) meammoacidsequenceofmeBCSGlpolypeptidehavmg 

, , hv the cDNA clones contained in ATCC 
the amino acid sequence encoded by the curs* 

any one of the polypeptides of (a), (b) or (c). 
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12 . ^ isolated antibody that speedy «o a BCSG1 

polypeptide of claim 10. 

acid substitutions, said polypept.de has asequ 
oonsistingof: ^ ^ ^ . polypeptide comprising 

^^^^^^^ 

^o.ids^uLcncodedbytbecm.c.onecon^edtn.TCCOep 

^•""^'X'tnuCeotide seance complementary » - of *. 
nucleotide sequences in (a), (b) or (c). 

• , , A BCSG1 polypeptide wherein, except for at least one 
w aminoacidsftomabout2«oabou.l27mSEQ 
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(c) ^arninoacidse^enceoftheBCSGl ******** 
^bvAecDNAclon^contmnrfinATCCDepos.t 
,1* amino add sequence eacodedby the cDNA 

^'"""^te^ac^uenceo^epi.pe-^ponionof 

any one of the polypeptides of (a), 0>) or (c). 

15 A m etod forbreas, tumor diesis in a, »n^ 
ion level of to gene encoding the BCSG1 protem u, cells or 
assaying the expression level ol tn g expreS sion level with a 

My fluid of to individual and «^J- -P^ ^ fc ^ 

^ * nr^Gl Rene expression level, whereby an 
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Breast Cancer Specific Gene 1 
Abstract 



, « CS G1 protein. In particular, 

BCSG> po.^ are ^ o *« 

m efrods for producing the same. Also pr 



detecting breast cancer. 



S1-01.AFL 



Figure 1 



30 .tcgccaagrISgccctgctgg 



M 110 



70 



90 



^CKm^^C^G^C^fT K T K B G V 

A VEK TKy 

170 

M Y V O A K T 

210 „_^^.AGCAGCGTCAACACTG 



WO ^^.XGCAAGGCTCTGGTGAGCA. 



.GGCCAACGCCXTTGAC^GGCT^^— v „ T V 

290 



AGAAGACCAAGGAGCAGGCC^^^' ~ R A V V 



270 ^GGTCACCTCCGGGGTGGTGCGCA 
TGGCCACCAAGACCG^^— t A 



^.CACCGTC^^ R K 

A T K T 



350' 

^-«rfr rnrr. 

E D L R P 410 



V A E E A Q S 



CTGAAGAGCGCTCCTCTGCCTTGGA' 



470 

450 ACAAGGAGTGCCCGCCTT 



43 °„._^.ocTTGGACACCATCCCCTCCTAGC. 

530 

490 ^^GCTCCTGCCCTCGTCTCCl 



GAGTGACATGCGGGTGCCCACGCTCCTGC 



550 

ACCTGTGCTG 



Figure 2 





FIGURE 3 
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1 6. (previously presented) An^^^ed^aynucleotide comprising a nucleic acid 
encoding amino acids 2 to 127 of SEQ ID NO:2. 

1 7. (previously presented) The isolated polynucleotide of claim 16, comprising 
nucleotide 15 to 392 of SEQ ID NO.l. 

1 8. (previously presented) The isolated polynucleotide of claim 1 6, comprising a 
nucleic acid encoding amino acids 1 to 127 of SEQ ID NO:2. 

19. (previously presented) The isolated polynucleotide of claim 1 8, comprising 
nucleotide 12 to 392 of SEQ ID NO: 1 . 

20. (previously presented) The isolated polynucleotide of claim 16, which is 

DNA. 



21. (previously presented) The isolated polynucleotide of claim 16, which is 

RNA. 



22. (previously presented) The isolated polynucleotide of claim 16, further 
comprising a heterologous polynucleotide. 

23 . (previously presented) The isolated polynucleotide of claim 22, wherein said 
heterologous polynucleotide encodes a heterologous polypeptide. 

24. (previously presented) A method of producing a vector that comprises 
inserting the isolated polynucleotide of claim 16 into a vector. 

25. (previously presented) A vector comprising the isolated polynucleotide of 
claim 16. 



26. (previously presented) The vector of claim 25, wherein said polynucleotide is 
operably associated with a heterologous regulatory sequence. 
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27. (previously presented) A host cell comprising the isolated polynucleotide of 
claim 16. 

28. (previously presented) The host cell of claim 27, wherein said isolated 
polynucleotide is operably associated with a heterologous regulatory sequence. 

29. (previously presented) A method of producing a polypeptide that comprises 
culturing the host cell of claim 28 under conditions such that said polypeptide is expressed, 
and recovering said polypeptide. 

30. (previously presented) A composition comprising the isolated polynucleotide 
of claim 16 and a pharmaceutically acceptable carrier. 

3 1 . (previously presented) An isolated polynucleotide comprising a nucleic acid 
encoding the completed amino acid sequence encoded by the cDNA clone of ATCC Deposit No. 
97856. 

32. (previously presented) The isolated polynucleotide of claim 3 1 , which is DNA. 
3 3 . (previously presented) The isolated polynucleotide of claim 3 1 , which is RNA. 

34. (previously presented) The isolated polynucleotide of claim 31, further 
comprising a heterologous polynucleotide. 

35. (previously presented) The isolated polynucleotide of claim 34, wherein said 
heterologous polynucleotide encodes a heterologous polypeptide. 

36. (previously presented) A method of producing a vector that comprises inserting 
the isolated polynucleotide of claim 31 into a vector. 

37. (previously presented) A vector comprising the isolated polynucleotide of claim 

31. 
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38. (previously presented) The vector of claim 37, wherein said polynucleotide is 
operably associated with a heterologous regulatory sequence. 

39. (previously presented) A host cell comprising the isolated polynucleotide of claim 

31. 

40. (previously presented) The host cell of claim 39, wherein said isolated 
polynucleotide is operably associated with a heterologous regulatory sequence. 

41. (previously presented) A method of producing a polypeptide that comprises 
culturing the host cell of claim 40 under conditions such that said polypeptide is expressed, and 
recovering said polypeptide. 

42. (previously presented) A composition comprising the isolated polypeptide of 
claim 3 1 and a pharmaceutically acceptable carrier. 

43. (canceled) 

44. (previously presented) An isolated polynucleotide fragment of SEQ ID NO:l 
consisting of at least 100 contiguous nucleotides of the coding region of SEQ ID NO:l or the 
complement thereof. 

45. (previously presented) The isolated polynucleotide fragment of claim 44, 
consisting of at least 250 contiguous nucleotides of the coding region of SEQ ID NO:l or the 
complement thereof. 

46. (previously presented) The isolated polynucleotide of claim 44, which is DNA. 

47. (previously presented) The isolated polynucleotide of claim 44, which is RNA. 
48-49. (canceled) 
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50. (previously presented) A method of producing a vector that comprises inserting 
the isolated polynucleotide of claim 44 into a vector. 

5 1 . (previously presented) A vector comprising the isolated polynucleotide of claim 

44. 

52. (canceled) 

53 . (previously presented) A host cell comprising the isolated polynucleotide of claim 

44. 

54-56. (canceled) 

57. (previously presented) An isolated polynucleotide, encoding a fragment of SEQ 
ID NO:2 selected from the group consisting of: 

(a) a polypeptide consisting of at least amino acids 94 to 107 of SEQ ID 

NO:2; and 

(b) a polypeptide consisting of at least amino acids 120 to 127 of SEQ ID 

NO:2. 

58. (previously presented) The isolated polynucleotide of claim 57, wherein said 
polypeptide is (a). 

59. (previously presented) The isolated polynucleotide of claim 57, wherein said 
polypeptide is (b). 

60. (previously presented) The isolated polynucleotide of claim 57, which is DNA. 

61 . (previously presented) The isolated polynucleotide of claim 57, which is RNA. 

62. (previously presented) The isolated polynucleotide of claim 57, further 
comprising a heterologous polynucleotide. 

4 
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63. (previously presented) The isolated polynucleotide of claim 62, wherein said 
heterologous polynucleotide encodes a heterologous polypeptide. 

64. (previously presented) A method of producing a vector that comprises inserting 
the isolated polynucleotide of claim 57 into a vector. 

65 . (previously presented) A vector comprising the isolated polynucleotide of claim 

57. 

66. (previously presented) The vector of claim 65 , wherein said polynucleotide is 
operably associated with a heterologous regulatory sequence. 

67. (previously presented) A host cell comprising the isolated polynucleotide of claim 

57. 

68. (previously presented) The host cell of claim 67, wherein said isolated 
polynucleotide is operably associated with a heterologous regulatory sequence. 

69. (previously presented) A method of producing a polypeptide that comprises 
culturing the host cell of claim 68 under conditions such that said polypeptide is expressed, and 
recovering said polypeptide. 

70. (previously presented) A composition comprising the isolated polynucleotide of 
claim 57 and a pharmaceutically acceptable carrier. 

7 1 . (previously presented) An isolated polynucleotide molecule comprising a nucleic 
acid 95% or more identical to a reference nucleic acid encoding an amino acid sequence selected 
from the group consisting of: 

(a) amino acids 1 to 127 of SEQ ED NO:2; 

(b) amino acids 2 to 127 of SEQ ID NO:2; and 

(c) the amino acid sequence encoded by the cDNA clone contained in ATCC 
Deposit No. 97856; 

wherein said polynucleotide is overexpressed in breast cancer 

5 
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72. (previously presented) An isolated polynucleotide comprising a nucleic acid 
encoding an amino acid sequence, wherein, except for one to thirty conservative amino acid 
substitutions, said amino acid sequence is selected from the group consisting of: 

(a) amino acids 1 to 127 of SEQ ID NO:2; 

(b) amino acids 2 to 127 of SEQ ID NO:2; and 

(c) the amino acid sequence encoded by the cDNA clone contained in ATCC 
Deposit No. 97856; 

wherein said polynucleotide is overexpressed in breast cancer 

73. (previously presented) The isolated polynucleotide of claim 72, wherein the 
number of substitutions is not more than 10. 

74. (previously presented) The isolated polynucleotide of claim 73, wherein the 
number of substitutions is not more than 5. 

75. (previously presented) The isolated polynucleotide of claim 74, wherein the 
number of substitutions is not more than 3. 

76. (previously presented) An isolated polynucleotide comprising a nucleic acid 
encoding an amino acid sequence 95% or more identical to a reference amino acid sequence from 
the group consisting of: 

(a) amino acids 1 to 127 of SEQ ID NO:2; 

(b) amino acids 2 to 127 of SEQ ID NO:2; and 

(c) the amino acid sequence encoded by the cDNA clone contained in ATCC 
Deposit No. 97856; 

wherein said polynucleotide is overexpressed in breast cancer 

77. (previously presented) An isolated polynucleotide comprising a nucleic acid 
which is 95% or more identical to a reference nucleic acid, wherein said reference nucleic acid is 
selected from the group consisting of: 

(a) nucleotides 1 5 to 392 of SEQ ED NO: 1 ; and 

(b) nucleotides 12 to 392 of SEQ ID NO:l; 

6 
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and wherein said polynucleotide is overexpressed in breast cancer. 
78-79. (canceled) 
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